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Abstract 

In this paper we establish an estimate for the rate of convergence of 
the Krasnosel'skii-Mann iteration for computing fixed points of non- 
expansive maps. Our main result settles the Baillon-Bruck conjecture 
[3] on the asymptotic regularity of this iteration. The proof proceeds 
by establishing a connection between these iterates and a stochastic 
process involving sums of non-homogeneous Bernoulli trials. We also 
exploit a new Hoeffding-type inequality to majorize the expected value 
of a convex function of these sums using Poisson distributions. 
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1 Introduction 

Let T : C — > C be a non-expansive map denned on a convex subset C C X of 
a normed space (X, || • ||). The Krasnosel'skh-Mann iteration for computing 
a fixed-point of T is defined by (cf. [22j 123] ) 

x fc = (1 - oi k )x k ^i + a k Tx k -i (1) 

with xo G C given and G [0, 1]. 

Strong convergence of x k to a fixed point was proved in [22j Krasnosel'skii] 
for a k = ~, when X is a uniformly convex Banach space and T(C) is con- 
tained in a compact subset of C. This result was extended to a k = a [28J 
Schaefer] and X strictly convex 0, Edelstein], while [T7J Ishikawa] proved it 
for general Banach spaces with a k bounded away from 1 and a& = oo. The 
Banach case with a k = a was also considered in (TUJ Edelstein and O'Brien]. 
Without the compactness assumption, weak convergence was established in 
[251 Reich] assuming ^2a k (l — a k ) = oo and Fix(T) ^ cj), for X uniformly 
convex with a Frechet differentiable norm. Although strong convergence does 
not hold in general (see jT2(, Genel and Lindenstrauss] and [El Bauschke et 
al.]), it does occur for most operators in the sense of Baire's categories (see 
[271 Reich and Zaslavski]). 

The crucial step in proving the convergence of the iterates in all these 
results is to show that ||£n — Tx n \\ tends to 0, a property which is now known 
as asymptotic regularity [H El [H [26] . Under various assumptions, asymptotic 
regularity was also proved in [T51 Groetsch] and [131 Goebel and Kirk]. The 
latter noted a certain uniformity in the convergence, namely, for each e > 
we have \\x n — Tx n \\ < e for all n > n , with n depending on e and C but 
independent of the initial point xq and the map T. More recently, using proof 
mining techniques, Kohlenbach [201 121] showed that n$ could be chosen to 
depend on C only through its diameter. An explicit metric estimate which 
readily implies all these results was stated in [31 Baillon and Bruck], namely, 
they conjectured the existence of a universal constant k such that 

\\x n - Tx n \\ < k ' == (2) 

and proved it for the case an = a with k — 

In this paper we settle this conjecture by proving that the bound holds in 
general with k — ^= for any sequence a k and each non-expansive T : C — > C . 
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Although we do not know whether this is the smallest possible k, we provide 
an example which shows that it cannot be improved by more than 17%. We 
also discuss how the result can be used to analyze the convergence of ([T]), 
and how it applies when C is unbounded but Fix(T) ^ (f>. 

Our proof is based on a recursive bound for the distances between the 
iterates ||x m — x n \\ < c mn , where c mn admits a nice probabilistic interpretation 
in terms of a random walk on Z. In proving the theorem we exploit some 
properties of the hypergeometric and modified Bessel functions, as well as a 
known identity for Catalan numbers. We also use the following Hoeffding- 
type inequality which might be of interest on its own: if S = Xi + ■ ■ ■ + X m 
is a sum of independent Bernoullis and Z is a Poisson with the same mean 
K(Z) = E(S'), then IL[g(S)] < K[g(Z)] for every convex function g : N — >• R. 

2 Main result 

Theorem 1. The KrasnoseV skii-Mann iterates generated by ([T]) satisfy 

\K-TxJ< (3) 

The proof is split into several intermediate steps. Note that by rescaling 
the norm, we may assume diam(C) = l. 

2.1 A recursive bound 

Let pk = Hj =1 (l — aj) and = p n ^ = akH™ =k+1 (l — aj). By convention we 
also set p = a = l, while the term Tx_i is interpreted as x . 

Proposition 2. For n > we have x n = X]fc=o 7T k'^ x k-i an d 

m n 

x m - x n = ^2 ^T 71 ^ i Tx j-i ~ Tx k-i] forO <m<n. (4) 

.7=0 k=m+l 



Proof. Dividing (JTJ) by p k we have — = 4- ^Tx^-i which, when iterated, 
yields — = x + Ylk=i ~ r ^ x k-\- Using the conventions p = a = 1 and 
xo = Tx-i we get precisely x n = Ylk=o 7r k^ lx k-i- This equality, combined 
with the identities Y^JLo n T = 1 an< ^ ~ n k = YTj= m +i ^j^Ti yields 
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x m -x n = ^(7r™-7r^)Tx fe _i - ^2 K Tx k-i 

k=0 k=m+l 
m n m n 

= E E wra*-! -EE *7< Tx ^ 

fc=0 j=m+l j=0 k=m+l 

so that exchanging j and in the first double sum we obtain (j3J). q 

Corollary 3. Define c mn recursively by setting c_i >n = 1 /or all n > and 

(R) Cmn = EJoELh-i^^H c i-i,fc-i /orO<m<n. 

T/ien ||x m — x n || < c m „ /or all < m < n. 

Proof. The proof is by induction on n. Suppose that \\xj — Xk\\ — c jk holds 
for all < j < k < n — 1. Using the triangle inequality in (J4j) we get 

Ikm - X n \\ < J2™ =0 ELm+l^jX \\ T Xj-l ~ Tx k ^\\. (5) 

The induction hypothesis gives ||T;Tj_i —Txk-i\\ < H^j-i — ^fc-i|| < c j-i,k-i 
for 1 < j < k, while for j = we have ||Ta;_i — Txfc_i|| = ||x — Tx^-iW < 
diam(C) = 1 = c_i^_i. Plugging these bounds into ([5]) and using (R) we 
deduce \\x m — x n \\ < c mn completing the induction step. □ 

Note that for m = n we have c nn = and the inequality ||x n — x n \\ < c nn 
holds trivially. More interestingly, sillCG || X n ^n+lll — ^n+l||'-^ri 

— Txn\\ we 

have \\x n — Tx n \\ < c ^' w+ ] 1 = P n so that Theorem [1] will follow by showing 

VE;'=i"i(i-«o pn < ( 6 ) 

Our analysis proves that this bound is sharp, so that -7= is the best constant 
one can get from Corollary [31 This does not exclude the possibility that 
other techniques might lead to sharper bounds in Theorem [1] (c/. j2j Baillon 
and Bruck]). 

2.2 Fox-and-Hare race and a random walk 

The recurrence (R) has a probabilistic interpretation. Consider a fox at 
position n trying to catch a hare located at m < n. At each integer i G N 
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the fox must jump over a hurdle to reach i — 1. The jump succeeds with 
probability (1 — aij) in which case the process repeats, otherwise the fox falls 
at i — 1 where it rests to recover from injuries. Thus, starting from n the 
probability of landing at k — 1 is precisely tt%. The fox catches the hare if it 
jumps successfully down to m or below. Otherwise, the hare runs toward the 
burrow located at —1 by following the same rules. The process alternates 
until either the fox catches the hare, or the hare reaches the burrow. 

The recurrence (R) satisfied by c mn characterizes precisely the probability 
for the hare to reach the burrow safely when the process starts at (m, n). This 
is also consistent with the boundary cases c_i jn = 1 and c nn = 0. Note that 
cto = 1 so at i = both the fox and hare fall with certainty, landing at — 1. 
From this interpretation we get the following expression for c mn . 

Proposition 4. Let (i^)ieN an d {Hi)ien denote independent Bernoulli trials 
representing respectively the events that the fox and hare fail at the i-th hurdle, 
so that F(F i = l)=F(H i = l) = a i . Then 

Cmn = nEtk F i > ET=k H t forallk = m + l,...,l). (7) 
In particular, denoting Z^ = F^ — Hi we have 

pn=Cj ^t = V (Etk Zi > for k = n, . . . , 1). (8) 

Proof. Formula (JZJ) is just a restatement of the fact that the hare wins iff the 
number of times the fox falls in any interval {k, . . . , n} is strictly larger than 
the number of falls of the hare in {k, . . . , m}. The expression for P n follows 
by noting that the event corresponding to c n>n+ i in (JTJ) requires F n+1 = 1 
(take k — n + 1). □ 

Formula © has an alternative interpretation. Let Pi = 2a,i(l— ctj) so that 
Zi takes values in { — 1, 0, 1} with probabilities 1—pi, y. The sums Y^i=k %i 
taken in reverse order k = n, . . . , 1 define a random walk on Z where at each 
stage the process stays at the current position with some probability, and 
otherwise moves left or right with equal probability as in a standard random 
walk. Hence, P n is the probability that the walk remains non-negative over 
n stages. Conditioning on the total number of stages at which the process 
effectively moves, this is also the probability that a standard random walk 
stays non-negative over a random number of stages. Using this interpretation 
we get the following more explicit formula. 
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Proposition 5. Let M = Mi + . . . + M n be a sum of independent Bernoullis 
with success probabilities P(Mj = 1) =pi = 2ai(l — ai) and consider the integer 
function F(m) = { [ S/ 2 \) 2 ' rn '■ Then pn = E[F(M)]. 

Proof. The variable Mj can be interpreted as move/stay and Z^ can be ex- 
pressed as Zi = M,iDi with Di independent variables representing the direc- 
tion of the movement: P(-Dj = —1) = P(Z?j = 1) = \. Conditioning on the 
sum M and using the exchangeability of the variables Di we obtain 

pn = E m =o P (Er=fe M * D i > for = n, . . . , l\M — m)P(M = m) 
= E m =o P (E U > for £ = 1, . . . , m)P(M = m). 

The expression P(X!j=i Dj > for £ = 1, . . . , m) is the probability that a 
standard random walk started from remains non-negative over m stages. 
Its value is precisely F(m) [TTJ Ch. III. 3] so the conclusion follows. □ 

The next result establishes an alternative recursion satisfied by c mn . This 
is not used in our proof, but we state in case someone could use it to find a 
simpler proof of Theorem [TJ 

Proposition 6. Denoting = 1 — a& ; we have the recurrence 
Proof. Denote Wjk = n^n^Cj-i^-i and let S = A + B — C — D with 

A n V m V™ in-, 

si — ^mn — Z^j=0 l^k=m+l w jk 

t-) — — \—\m— 1 v-^" - 1 

B = a m O n C m _i in _i = 2^=0 l^k=m W jk 

^ — <%m c m-l,n — 2^ij=0 l^ik=m W jk 

T-\ — _ v-^ m 1 

U ~ a nC m ,n-l — 2-/j=0 2^fc=m+l W jk- 

Canceling out the common terms we get S = w mn = a m a n c m _i in _i which is 
exactly (jHJ). □ 

2.3 A sharp upper bound 

From Proposition |5j the bound ([6]) is equivalent to showing that 
R n (p) = VPi + ---+Pn E[F(Mi + . . . + M n )] < ^/f 
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for all n and < Pi < \. The function R n {p) is strictly concave in each 
variable pi separately, so the maximum is attained at the extreme values 0, | 
or at a unique point in (0, |). Interestingly, all non-extreme coordinates may 
be taken equal. 

Lemma 7. R n (p) is maximal when pi G {0,u, |} for some < u < ~. 

Proof. Let p maximize R n {p) and suppose = x and Pk = V with x,y E (0, |) 
and i^j/. Let /i(/c) = E[F(/c + 5)] where 5 = ^2 if Lj k Mi so that 

P" = (l-j;)(l-y)/i(0) + [a;(l-7/)+y(l-x)]/i(l)+xy/i(2) 
= a + 6(a: +?/) + cxy 

with a = h{0), b = h(l)-h{0) and c=/t(0)+/i(2)-2/j(l). Setting m=Y l ^ k Pi 
it follows that x,y G (0, |) maximize the expression 

A/m + a; + y [a + 6(x + y) + cxy]. 

Setting the partial derivatives to we get cx = cy and since x ^ y it follows 
that c = 0. But then, the function depends only on the sum x + y and we 
may change these coordinates to x + e, y — e keeping the same value, until 
one of them hits an extreme value: either x + e = ^ory — e = 0. This yields 
a new optimal p with one coordinate less in (0, |). Repeating this process 
we get an optimal p whose coordinates take at most one value in (0, |). □ 

According to this Lemma, in order to bound R n (p) it suffices to consider 
the case Pi G {0,u, |} with < u < |. Moreover, by changing n we may 
ignore the deterministic variables with pi = 0. We distinguish two cases. 

2.3.1 All coordinates Pi = u 

In this case R n (p) = yjrvu E[F(5')] with S ~ B(n,u) Binomial. This case 
follows from the results in [31 Baillon and Bruck] which were obtained using a 
computer generated proof. Here we provide a direct proof based on a known 
identity for Catalan numbers. 

Proposition 8. Let S ~ B(n, u) with < u < |. Then 

E[F(5)] = ELo^ffla)(f) fc (10) 
and R n (p) = ^Jmx E[F(S')] increases with n towards J\. 
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Proof. Using the Binomial theorem, a straightforward computation gives 

= E;= E^(-i)^0')©C:j)(-^) fe 

= ELo(:)(-«) fc E- =0 (-i) J (-)^) (ii) 

where the last equality follows from the identity (™) Qlj) = (£) Q) and ex- 
changing the order of the sums. The last inner sum may be computed from 
a known identity for Catalan numbers Ck = ( 2 fc ) > nameljo 

Ck = E-=o(-l) J 2 fc - J C)( L / /2j ) = 2 fe E-=o(-l) J ©^(j) 

which when substituted into (TTTT) yields ffTOj) . 

By direct verification, the expression on the right of f JTOj) is the hyperge- 
ometric function 2 Fi(—n, |; 2; 2m), whose Euler integral representation gives 

E[F(5)] = |/ 1 r 1 / 2 (l-t) 1 / 2 (l-2 M t)"rft. 
Multiplying by ^Jmx and using the change of variables s = 2nut we get 

/2nu , 

which increases with n towards the limit — f^°A=e~ s ds = — V{\) = ,/^. n 

TT JO \/S 7T \ 2 ' V T I— I 



2.3.2 At least one coordinate Pi = \ 

With no loss of generality assume Pi = \ and denote S = M 2 + . . . + M n . 
Conditioning on M x and setting g(k) = ~[F(k) + F(k + 1)] we get 

E[F(M 1 + ... + M n )]=E[g(S)}. 

A direct calculation shows that g : N — > R is convex, namely 

g(k) < \{g{k-l) + g(k + l)] for all k > 1, 



^^See http://mathworld.wolfram.com/CatalanNumber.html. A proof is also given in 
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so we may use the Hoeffding-type inequality in Proposition [12] to obtain 
Elg(S')] < E[<7(Z)] with Z ~ P{z) a Poisson variable with z = P2 + • ■ ■ + p n - 
From this it follows that 

R n (p) < ^+lE[g(Z)] 

= exp(-*)[I (*) + (1 - ±)h(z)) (12) 

where I {z) = J2Zo ppd) 3 * and J i(^) = EZo k^+Ty.(lT k+1 are modified 
Bessel functions. 

Proposition 9. Let h(z) denote the expression in ffT2l) . Then h(z) is in- 
creasing with h(z) < Hindoo h(z) = Ji. 

Proof. The identities I' (z) = h(z) and I[(z) = Io(z) — zli{z) imply 
h 'W = 5$Tf [2(1 + *)h{z) - zI Q {z)\ 

so that proving that h is increasing reduces to zl (z) < 2(l + z)Ii(z). Letting 
x = z/2 and rearranging terms, this is equivalent to 

Eoo x 2k+1 ^ o v^°° x 2k+2 

k=l (fc-l)!(fe+l)! — z 2^k=0 k\(k+l)r 

This latter inequality follows easily by noting that each term on the left can 
be bounded from above by two consecutive terms on the right, namely 

(fc-l)!(fc+l)! — (k-l)\k\ fc!(fc+l)! 

which results from the trivial inequality kx < kik + 1) + x 2 . 

Thus h(z) is increasing and therefore it is bounded from above by its limit 
£ = lim^oo h(z). To prove that £ = one may use the known asymptotics 
exp(— z)y/z I a (z) — >■ -4- (see [H Chapter 9]). Alternatively, one may use the 
integral representation I n (z) = ~ cos(n8)e zcos9 d8 to write 
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Since ^\J z +\ — > the relevant term for the limit is f^fl + cos 9}e z< - 1 cosd ^d6, 
which is transformed by the change of variables z(l — cos#) = x 2 /2 into 




□ 



Remark. An alternative proof of the monotonicity of h(z) is obtained by 
substituting the well-known recurrence I n+ i = l n -\ — —I n into the Turan- 
type inequality I„_i4 + i < I 2 (see [21]) which gives I 2 n _ x - ^I n -\I n < I 2 . 
Denoting x = J n _i// n we have x 2 — ^x < 1, and solving the quadratic we get 
x < - + a/1 + (f) 2 - For n = 1 this last expression is smaller than 2(z + l)/z 
which gives zl (z) < 2{z + l)Ii(z) so that h'(z) > 0. 

2.4 Conclusion 

The bounds in §2.3l establish and prove Theorem [TJ Moreover, the bound 
is sharp and cannot be improved. Indeed, for Oj = a constant, setting 
u = 2a(l — a) and S ~ B(n,u) we have 

and by Proposition [8] this quantity converges to as n — > oo. This does 
not mean that ([3]) is itself sharp since we only have \\x n — Tx n \\ < P n . Thus, 
a natural question is to find the smallest constant k for which ((2]) holds. 
Although we do not know whether ([3]) is sharp or not, the following example 
shows that this bound cannot be improved by more than 17%. 

Example. Take X = ^(N) and let C be the set of all sequences x = (x*) i6N 
with x i > and J2Zo x * < !> so that diam(C) = 2. Let T : C C 
be the right-shift isometry T( ,...). Then, the 

iteration {KM) started from x = (1, 0,0,.. .) generates a sequence of the 
form x n = (p°, p^, ... , p™, 0,0,.. .) with 

p i n = F(X 1 + ... + X n = t) 

where Xi are independent Bernoullis with P(Xj = 1) = «j. It follows that 

= 2 max{p l n : < i < n}. 
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Now, consider n = 2m Bernoullis trials, half of them with success probability 
a.i = — and the other half with a» = 1 — — . Then 

max{j9^ : < % < n} > pf m = F(X = Y) 

with X, Y independent Binomials B(m, —). When m — > oo these Binomials 
converge to Poissons so that p™ m tends to ^^ =0 ( exp ^"' > " ) 2 = exp(— 2u)Iq{2u). 

Since J Y^=i a i (1 — a i) tends to a/2m, it follows that P™ m \fYli=i a i(^-~ a i) 
can be made as close as desired to the value r\ = max^o y/x exp(-x) I (x) . 
Hence the optimal k lies in the interval [rj, ^] ~ [0.4688, 0.5642] which leaves 
a margin of at most 17%. 



3 Two direct applications of Theorem [T] 

3.1 Convergence of the iterates 

The following result, which is basically known (c/. j?l HU [151 El (HI 125]). 
shows how Theorem [1] can be used to obtain the convergence of the iterates, 
proving at the same time the existence of fixed points. 

Proposition 10. Suppose afc(l — a^.) = oo and Xk bounded. 

(a) If Xk is relatively compact then Xk — > x for some x G Fix(T). 

(b) If X is a Hilbert space then for some x e Fix(T) . 

Proof, (a) Choose a convergent subsequence Xk n —> x. From (EJ we obtain 
Xk — Txk — > so that x must be a fixed point. Since 

= \\(l-atk)(xk-i-x) + a k {Txk-i-Tx)\\ < \\x k -i-x\\ 

we conclude that \\xk~ x\\ decreases to 0. 

(b) Since / — T is maximal monotone and Xk — Txk — >• 0, all weak cluster 
points of Xk belong to Fix(T). As before \\Xk~ x\\ converges for all x G Fix(T) 
so that weak convergence follows from Opial's lemma. m 
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3.2 Unbounded domains 

When C is unbounded (j2j) says nothing. However, if Fix(T) 7^ is nonempty!!, 
then for each y G Fix(T) we may still apply (j2J) on the bounded subset 
C = Cn%, ||y-aro||) which satisfies T(C) C C and diam(C') <2||y-a? ||- 
Hence, setting k = 2k and taking the infimum over y G Fix(T) we obtain 

ii^-r,ji<K d ;^°- F ;f' ' . us) 

In particular, Theorem [1] implies that ([TBI holds with k = -J= ~ 1.1284. 
In Hilbert spaces, [301 Vaisman] established a sharper bound with « = 1. We 
present this result which exploits the well-known identity 

\\(l-a)u + av\\ 2 = (l-a)||w|| 2 + a\\v\\ 2 - a(l-a)\\u - v\\ 2 . (14) 

Proposition 11. Let T : C — )■ C fre non- expansive on a convex C G E with 
E a Hilbert space and Fix(T) nonempty. Then (|T3|) holds with k = 1. 

Proof. It is known that — T"xfe|| decreases with fc. Indeed, 

||x fc -Ts fe || = \\(l-a k )x k -i + a k Tx k -i - Tx k \\ 

< (l-a fc )||ar fc _i -Tar fc _i|| + -Tx fc || 

< (l-a fc )||x fe _i - Tx fc -i || + ||xfc_i - x k \\ 

= (l — a k )\\xk-i - Tx k -i\\ + a fc ||x fc _i - Tx k -i\\ 
= \\x k -i — TXfe-iW- 

Now, using (fT4j) . for each y G Fix(T) we get 

ll^i-2/ll 2 = \\0--<Xi)(xi-i-y) + oti(Txi-i-Ty)\\ 2 

= (l-a i )||a;i_i-y|| 2 +a i ||Ta; i _i-T?/|| 2 -a i (l-a l )||x i _i-T.T i _ 1 || 2 
< \\xi-i - y\\ 2 - ai(l-ai)\\xi-i - Tx, ; _i|| 2 . 

Summing these inequalities we see that 

J2Z=l a i0-- a i)\\ X i-l ~ Tx i-l\\ 2 < \\ x ~ V\\ 2 ~ \\%n-y\\ 2 



2 A necessary and sufficient condition to have Fix(T) ^ is that the iterate sequence 
{xk} remains bounded (c/. [24]). 
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and the monotonicity of \\xk — Txk\\ yields 

\\x n - Tx n \\^YTi=i a i{ l - a i) < \\ x o - 2/||- 
The conclusion follows by taking the infimum over y G Fix(T). n 

Remark: The previous proof yields a slightly sharper estimate 

dist(xo,Fix(T)) 

\\ x n-l J- J<n-1\\ ^ / = m j- r- 

with x n _i in place of x n on the left. 



4 Auxiliary results 

4.1 A Hoeffding-type inequality 

In this short section we establish a Hoeffding-type inequality for sums of 
Bernoullis and Poisson variables. We consider an integer function g : N — > R 
satisfying the convexity inequalities g(k) < \[g(k—l) + g(k+l)] for all k > 1. 

Proposition 12. Let S = X 1 + • — h X m &e a sum of independent Bernoulli 
trials with success probabilities = 1) =Pi, and let z = K(S) = pi + . ■ -+p n - 
Then K[g(S)] < K[g(Z)} where Z ~ P(z) is a Poisson with the same mean. 

Proof. Let us first note that the expected value E[p(S')] increases if we replace 
any variable Xi by a sum X[ + X" of independent Bernoullis with 



p(x; = i) = p(xf = i) 



Pi 

2 ' 



Indeed, for keN let A(k)—'E\g(k+X i )] and B(k) =E[g(k+X' i +Xl')} so that 

= (1- Pi )g(k)+ Pi g(k + 1) 
B(k) = (i-*)* g (k)+p i (l-*)g(k + l) + (*) 2 g(k + 2). 

Taking their difference we have 

B(k) - A(k) = (^) 2 [g(k) - 2g(k + 1) + g(k + 2)] > 

so that replacing k by the random variable X^yj -^-j an d taking expectation 
we obtain the asserted monotonicity. 
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Now, a well-known result by Hoeffding (T6J Theorem 3] proves 
E[g(S)] < R\g(Si)] with Si ~ B(n,p) a binomial with p = \(p x + ...+ p n ). 
Writing Si as a sum of n Bernoullis B(p) and sequentially replacing each 
term by two Bernoullis B(^ ), the expected value increases in each step and 
we get E[g(S)] < E[g(S 2 )] with S 2 ~ B(2n,p/2). Iterating this doubling 
argument we obtain E[p(5)] < Efg^)] where Sk ~ B(2 k n,p/2 k ). Since 
E(5fe) = z for all fc, the result follows by letting k — > oo and noting that Sk 
converges to a Poisson variable Z ~ -P(z)- □ 



4.2 An identity for Catalan numbers 

In proving Proposition [8] we used the identity 

^ = E-=o(-i)^C)( L / /2J ). 

Since this is not found in standard textbooks, for completeness we provide a 
proof. For each a G Z and P(x) a Laurent polynomial (i.e. a function whose 
Laurent series has finitely many terms) we denote by [x a ]P(x) the coefficient 
of x a in P(x). We observe that for each non- negative integer j we have 

[x°](x 2 +x- 2 y 
[x 2 ](x 2 +x- 2 y 

so we can write (1,-/21) = (l x °] + f^ 2 ])^ 2 + x~ 2 y and therefore 

ZU-^^Qdm) = M + [^])E- = o(-)2^(-x 2 -x- 2 y 

= ([x°] + [x 2 ]) (2-x 2 -x- 2 ) h 
= ([x°} + [x 2 ]) (-(x l -x~ 1 ) 2 ) h 
= ([ a; ' , ] + [x 2 ])(-l)*(x 1 -x- 1 ) 2fc 

(2k\ I 2k \ _ 
— \k)~ \k+l) ~ Ufc ' 

3 As a matter of fact, Hoeffding assumes g strictly convex but the general case follows 
by applying his result to g{x) + ex 2 with e \. 0. 



(|) for j even 

for j odd 

for j even 

(ill) f° r i °dd 
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